Receptive Field Block Net for Accurate and Fast Object Detection

نویسندگان

  • Songtao Liu
  • Di Huang
  • Yunhong Wang
چکیده

Current top-performing object detectors depend on deep CNN backbones, such as ResNet-101 and Inception, benefiting from their powerful feature representation but suffering from high computational cost. Conversely, some lightweight model based detectors fulfil real time processing, while their accuracies are often criticized. In this paper, we explore an alternative to build a fast and accurate detector by strengthening lightweight features using a crafting mechanism. Inspired by the structure of Receptive Fields (RFs) in human visual systems, we propose a novel RF Block (RFB) module, which takes the relationship between the size and eccentricity of RFs into account, to enhance the discriminability and robustness of features. We further assemble the RFB module to the top of SSD with a lightweight CNN model, constructing the RFB Net detector. To evaluate its effectiveness, experiments are conducted on two major benchmarks and the results show that RFB Net is able to reach the accuracy of advanced very deep backbone network based detectors while keeping the real-time speed. Code is available at https://github.com/ruinmessi/RFBNet.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Novelty detection in image recognition using IRF Neural Networks properties

Image Receptive Fields Neural Network (IRF-NN) is a variant of feedforward multi-layer perceptrons adapted to image recognition. It shows very fast training as well as robust and accurate results on supervised classification tasks. This paper presents another property of IRF-NN: responses of trained networks can be analysed to detect unknown images. Several discriminative and efficient novelty ...

متن کامل

Pii: S0306-4522(98)00620-4

Optokinetic nystagmus is a reflex to stabilize an object image on the retina by compensatory eye movements. In lower vertebrates, the nucleus of the basal optic root participates in generating this reflex. Visual responses of 135 neurons were extracellularly recorded from the nucleus in pigeons and their receptive field properties were analysed on-line with a workstation. These cells could be c...

متن کامل

Object - based Postprocessing of Block Motion FieldsFor Video

It is likely that in many applications block-matching techniques for motion estimation will be further used. In this paper, a novel object-based approach for enhancement of motion elds generated by block matching is proposed. Herein, a block matching is rst applied in parallel with a fast spatial image segmentation. Then, a rule-based object postprocessing strategy is used where each object is ...

متن کامل

Efficient Video Indexing for Fast-motion Video

Due to advances in recent multimedia technologies, various digital video contents become available from different multimedia sources. Efficient management, storage, coding, and indexing of video are required because video contains lots of visual information and requires a large amount of memory. This paper proposes an efficient video indexing method for video with rapid motion or fast illuminat...

متن کامل

Do Convnets Learn Correspondence?

Convolutional neural nets (convnets) trained from massive labeled datasets [1] have substantially improved the state-of-the-art in image classification [2] and object detection [3]. However, visual understanding requires establishing correspondence on a finer level than object category. Given their large pooling regions and training from whole-image labels, it is not clear that convnets derive ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1711.07767  شماره 

صفحات  -

تاریخ انتشار 2017